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Ligation-based synthesis of oligonucleotides with block structure 

The present invention relates to a method of producing single-stranded nucleic 
acid molecules from oligo- or polynucleotides wherein each of said oligo- or 
polynucleotides has a predefined 5' or 3' terminus, comprising the steps of (a) 
annealing an adaptor oligonucleotide simultaneously or step by step to (aa) a 
first oligo- or polynucleotide; and (ab) a second oligo- or polynucleotide wherein 
the 5'-terminus of said adaptor oligonucleotide is complementary in sequence to 
the 5' terminus of said first oligo- or polynucleotide and the 3'-terminus of said 
adaptor molecule is complementary in sequence to the 3' terminus of said 
second oligo- or polynucleotide; and optionally (a') simultaneously with or 
subsequently to step (a) annealing at least one further adaptor oligonucleotide to 
free termini of said first or second oligonucleotides and to free termini of further 
oligo- or polynucleotides; (b) optionally filling in gaps between the neighbouring 
ends of said oligo- or polynucleotides; (c) ligating said oligo- or polynucleotides; 
and (d) removing said at least one adaptor oligonucleotide. In a preferred 
embodiment of the method of the invention, said single-stranded nucleic acid 
molecules represent a collection of nucleic acid molecules wherein either said 
first or said second oligo- or polynucleotide is invariable in sequence between all 
members of said collection of nucleic acid molecules. 



The invention is particularly efficient for the synthesis of long polynucleotides or 
sets of oligonucleotides with block structure. 

As is known in the art, oligonucleotide-producing companies cannot guarantee a 
quantitative yield for oligos longer than 100 nucleotides (nt). Moreover, the yield 
and quality of the synthesis decrease dramatically when the length of 
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oligonucleotide is more than 60 nt. The smallest possible scale of synthesis for 
80-100mers is more then 200 nmol. Two-step purification (HPLC and PAGE) is - 
required to obtain single^band oligonucleotides. The guaranteed output of 
purified 80-tOOmers is less than 1 nmol and the price is about 200-300 Euro. 

dligonUQleotidgs with block structure are widely used in molecular biology 
(Figure 1), Examples are: padlock probes (Lizardi et fit 1998; Pickering et al. 
2002); primers with con$tailt 5' regions, u?ed for multipLex PGR amplification 
(Eavjs et aj. 2000; Undbfad-Toh et al. 2000), ligase-independent cloning (de 
Costa and Tsmuri 1998; Rashtchian et al. 1992; Zhou and Hatahet 1995; and 
commercial kits from Novagen, Invitrogen, BP Biosciences) and Invader assay 
(Mein et al. 2000). 

Normally, these primers are synthesized by phosphoramidite technology and 
common regions have to be synthesized again and again in different 
oligonucleotides. It is expensive, especially, when the common part contains a 
hapten or fluorophore. 

This problem becomes evident, for example, in the preparation of sets of padlock 
probes for SNP-detection projects. Padlock probes are typically 90-1 20nt long 
oligonucleotides, which consist of two locus-specific regions on both 3' and 5' 
ends connected by universal linker part (Figure 1A). The high price and the low 
yield of synthesis are the main obstacles for routine usage of padlock probes. 
Though they were shown to be gn excellent tool for SNP detection and in situ 
Idealization, only few laboratories work with padlocks until now. Accordingly, the 
technical problem underlying the present invention was to provide methods for 
the quantitative and cost-sensitive production of single-stranded nucleic acid 
molecules that can in particular be employed as padlock probes. 



The solution to said technical problem is achieved by providing the embodiments 
characterized in the claims. Thus, the present invention relates to a method of 
producing single-stranded nucleic acid molecules from oligo- or polynucleotides 
wherein each of said oligo- or polynucleotides has a predefined 5' or 3' terminus, 
comprising the steps of (a) annealing an adaptor oligonucleotide simultaneously 
or step by step to (aa) a first oligo- or polynucleotide; and (ab) a second oligo- or 
polynucleotide wherein the S'-terminus of said adaptor oligonucleotide is 
complementary in sequence to the 5* terminus of said first oligo- or 
polynucleotide and the 3'-terminus of said adaptor molecule is complementary in 
sequence to the 3' terminus of said second oligo- or polynucleotide; and 
optionally (a') simultaneously with or subsequently to step (a) annealing at: least 
one further adaptor oligonucleotide to free termini of said first or second 
oligonucleotides and to free termini of further . oligo- or polynucleotides; (b) 
optionally filling in gaps between the neighboring ends of said oligo- or 
polynucleotides; (c) ligating said oligo- or polynucleotides; and (d) removing said 
at least one adaptor oligonucleotide. 

In accordance with the present invention, the term "oligonucleotide" refers to a . 
uni-dimensional (i.e. not branched) stretch of nucleotides, preferably 
deoxyribonucleotides up to 30 nucleotides. The term also comprises 
oligonucleotides comprising or consisting totally . of ribonucleotides. Also 
envisaged is that the oligonucleotides comprise unusual nucleotides such as 
unusual nucleotides as, for example, deoxyuridine, biotinylated or fluorescently 
labeled nucleotides, spacers or abasic residues. It is preferred that the 
oligonucleotide employed in the method of the invention consists of the four 
naturally occurring deoxyribonucleotides, i.e. adenine, cytosine, guanine and 
tymidine. 

The term "polynucleotide" in accordance with the invention may consist of the 
same types of nucleotides . that are described above for oligonucleotides. 
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However, a polynucleotide in accordance with the invention comprises a 
unidimensional stretch of at least 31 nucleotides. 

The term "S^erming?" refers to the 5 '-terminal part of an oligcH or polynucleotide, 
preferably the terminal 5 or 4 nucleotides. 



The term Complementary jh sequence" refers to complementarity in sequence of. 
at least 75% <?f the respective nucleotides, preferably at least 90% of the 
respective nucleotides and mest preferred 1 00% of the respective nucleotides. 

In accordance with the present invention, a novel method of producing 
oligonucleotides by ligation of individual fragments by a ligase such as T4 DNA 
ligase is described. It is simple and allows the simultaneous processing of 
several reactions. The method is quantitative, cheap and does not require 
individual optimization. The possibility to purify products by HPLG makes ' the 
technology suitable for large-scale genomic projects. On the other hand, the 
same approach . may be used for small-scale synthesis of composite primers for 
two-step PGR amplification and ligation-independent cloning. Small-scale 
reaction does not require any purification. 

The method of the invention requires oligo- or polynucleotides having a ^FH&.u. 
) predefined 5' or 3' terminus to which an adaptor polynucleotide, which is 

complementary in sequence to, said predefined 5* or 3' terminus is annealed. 
Simultaneously or subsequently, the second oligo- or polynucleotide is annealed 
to said adapter oligonucleotide by way of complementarity of its 5' or 3' terminus. 
A schematic overview over the annealing process is provided in Fig. 2B wherein 
the first oligo- or polynucleotide may be represented by #R, the second oligo- or 
polynucleotide may be represented by #G and the adaptor oligonucleotide may 
be represented by #aR. Alternatively, the first oligonucleotide or polynucleotide 
may be represented by #C, the second oligonucleotide or polynucleotide may be 
represented by #L and the adaptor oligonucleotide may be represented by #aL 
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The situation including optional step (a') is represented by the complete 
arrangement of oligonucleotides depicted in Fig. 2B. For example, if #R 
represents the first oligo- or polynucleotide and #C represents the second oligo- 
or polynucleotide and #aR represents the. adaptor oligonucleotide, then Sal- 
represents the further adaptor oligonucleotide and #L represents the further 
oligo- or polynucleotide. 

If gaps are obtained after annealing of the adaptor oligonucleotide(s) to said first, 
second and optionally further oligo- or polynucleotide(s) then the gaps are filled 
in, for example, by polymerase activity such as T4 DNA polymerase activity. 
Subsequently, the at least two oligo- or polynucleotides are ligated using an 
appropriate ligase. Appropriate ligases depend, inter alia, on the nature of the 
oligo- or polynucleotides used for preparation of the single-stranded nucleic acid 
molecules. For example, if the oligo- or polynucleotides are DNA, than it is 
preferred to use the T4 DNA ligase. Other ligases may also be used, for example 
thermostable commercially available Tth, Taq or Pfu ligases. Another possibility 
to perform ligation is a chemical template-dependent reaction (Xu and Kool 

1999) , which uses chemically activated oligonucleotides instead of enzyme. . 

Finally, the at least one adaptor oligonucleotide is removed. Removal can be 
effected by denaturated PAGE or chromatography, such as FPLC or HPLC. 
Other methods are known in the prior art comprising capture of biotine labeled 
adaptors or destruction of ribonucleotide adaptors by RNase (Nilsson et al. 

2000) . 

The above steps performed in accordance with the present invention per se can 
be effected by the person skilled in the art according to conventional protocols 
such as are provided in the appended examples. Temperature ranges include 4 
to 42°C for annealing, fill-in reactions and ligation. 
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Reaction buffers include conventional reactions buffers such as disclosed, for 
example, in (Sambrook and Russell 2001). 

Preferred is a method of the invention wherein the complementarity in sequence 
is at least four nucleotides such as 5, 6, 7, 8, 9 or 10 nucleotides. It is particularly 
preferred that the number of nucleotides which are complementary in sequence 
is five nucleotides. Also particularly preferred is that there is no mismatch within 
the stretch of complementarity. . 

In another preferred embodiment the invention relates to a method wherein 
annealing and ligation are simultaneously performed. Buffers can easily be 
adjusted to have annealing and ligation performed simuftaneously. If these steps 
are performed simultaneously, then it is preferred that optional step (b) is 
omitted. The method of the invention can in this way be accelerated. 

It is also preferred in accordance with the method of the present invention that 
the most valuable oligo- or polynucleotide in step (a) and/or (a') is provided in 
molar deficit relative of other oligo- and polynucleotides. The molar deficiency will 
guarantee that said oligo- or polynucleotide is consumed in the ligation reaction. 
The term "most valuable oligo- or polynucleotide" with respect to the present 
invention refers to invention refers to either (i) the most expensive oligonucleotide 
(labeled by hapten or fluorophore or the longest oligonucleotide) or (ii) 
oligonucleotide available in less quantity if compared with others. 

In another preferred embodiment, the present invention relates to a method, 
wherein said single-stranded nucleic acid molecules represent a collection of 
nucleic acid molecules and wherein either said first or said second oligo or 
polynucleotide is invariable in sequence between all or essentially all members 
of said collection of nucleic acid molecules. 
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The term "essentially all members" refers to at least 90%, preferably at least 
95%, more preferred at least 98% and most preferred to at least 99% such as 
9.9.5% or 99.8% of all members. . 

This advantageous embodiment of the invention relates to, in other terms, a 
method of producing a collection of single-stranded nucleic acid molecules 
wherein each member of said collection of nucleic acid molecules comprises a 
portion that is invariable between all or essentially all members of said collection 
and at least one portion that is variable between different members of said 
collection and that is located 5' or 3' of said invariable portion, comprising the 
steps of (a) annealing at least one adaptor, oligonucleotide simultaneously or 
step by step to (aa) an oligo- or . polynucleotide representing said invariable 
portion; and (ab) oligo- or polynucleotides representing said variable portions, 
wherein (i) a first part of said at least one adaptor oligonucleotide is 
complementary in sequence to the 5' terminus of said nucleic acid molecule 
representing said invariable portion and a second part of the at least one adaptor 
molecule is complementary in sequence to the 3' terminus of a nucleic acid 
molecule representing said variable portion; or (ii) a first part of said at least one 
adaptor oligonucleotide is complementary in sequence to the 3' terminus of said 
nucleic acid molecule representing said invariable portion and a second part of 
the at least one adaptor molecule is complementary in sequence to the 5' 
terminus of a nucleic acid molecule representing said variable portion; (b) 
optionally filling in gaps between the neighbouring ends of said invariable and 
said variable portions; (c) ligating the invariable and variable portions; and (d) 
removing said at least one adaptor oligonucleotide. In accordance with this 
preferred embodiment, it is further particularly preferred that said nucleic acid 
molecule representing said invariable portion is annealed with two adapter 
oligonucleotides, wherein further one of said adapter oligonucleotides is in a first 
part complementary in sequence with the 5' end of said nucleic acid molecule 
representing said invariable portion and the second adapter oligonucleotide is in 
a first part complementary to the 3' end of said . nucleic acid molecule 
representing said invariable portion. In this embodiment, the respective termini of 
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the adaptor polynucleotides not annealed to said invariable portion are annealed 
to termini of oligo- or polynucleotides representing variable portions of the single- 
stranded nucleic acid molecule. A schematic overview of such an arrangement is 
provided in Fig. 2B. 

This embodiment of the method of the invention is particularly advantageous in 
the cost-sensitive and easy production, for example, padlock probes. It is also 
advantageous to use resulting single-stranded nucleic acid molecules in two-step 
PCR or ligase-independent cloning as will be discussed further below. 

In principle, the nucleic acid molecules representing the variable portions may 
have at least one conserved -terminus, namely the terminus that anneals to the 
adaptor oligonucleotide. In this case adaptor oligonucleotides may be essentially 
the same for the whole collection. Alternatively, the oligo- or polynucleotides 
representing variable portions may be without any conservative parts. Then the 
special adaptor oligonupleotide should be. used for annealing of each particular 
nucleic acid molecule representing the variable portion. The terminus of said 
special adaptor oligonucleotide not annealed to the nucleic acid molecules 
representing the invariable portion must be predefined in order to allow a 
successful annealing reaction. 

Most preferred is a method of the invention wherein the further oligo- or 
polynucleotides are variable in sequence between different members of said 
collection of nucleic acid molecules. 

In accordance with the embodiments pertaining to the production of a collection 
of single-stranded nucleic acid molecules, it is finally preferred that the 5' or 3' 
termini of said oligo- or polynucleotides representing said variable sequences 
which anneal to said 5' or 3' termini of said adaptor oligonucleotide are invariable 
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between different members of said oligo or polynucleotides representing said 
variable sequences. 

In another preferred embodiment of the method of the invention, ligation is 
effected with T4 DNA ligase. Most preferred is that about 1 unit of T4 DNA ligase 
is reacted in step (c) with about 4pmol of termini of the oligo- or polynucleotides 
annealed to said adaptor molecule(s). It is also preferred in this embodiment that 
the ligation reaction is carried out at a temperature of about 20°C. Ligation 
efficiency may significantly be increased , if the reaction is carried out at, for 
example, 37°Othe temperature optimum for T4 DNA ligase. At this temperature, 
it is required that the complementary sequences comprise 5 or more nucleotides. 

Further preferred is a method wherein the ligation reaction is earned out in the 
presence of some molecular crowding agent, for example, with at least 5% 
polyethylene glycol. The inclusion of polyethylene glycol above the indicated 
range is advantageous because it increases the ligation efficiency. 

In accordance with this preferred embodiment, it is more preferred that the 
ligation reaction is carried out in the presence of 12 to 18% polyethylene glycol. It 
is particularly preferred that the ligation reaction is carried out in the presence of 
about 15% polyethylene glycol. 

Preferred in accordance with the method of the invention is further that said 
polyethylene glycol is polyethylene glycol 6000. 

In an additional preferred embodiment of the present invention, the method 
further comprises the step of purifying said single-stranded nucleic acid 
molecules. Purification can be performed according to standard protocols, see for 
example Sambrook, J., D. Russell. 2001. Molecular cloning: A laboratory manual. 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. . 
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Purification advantageously includes PAGE (Polyacrylamide Gel 
Electrophoresis) preferably under denaturing conditions, FPLC or HPLC or 
chromatography. Also preferred is an embodiment of the method of the invention 
further comprising modifying at least one of said oligo- or polynucleptides. In the 
alternative of the aforementioned embodiment, at least one of said oligo- or 
polynucleotides is modified when added to the reaction, i.e. the first step of the 
method of the invention. 

In accordance with this preferred embodiment of the invention,, the modification 
of the at least one olig6- or polynucleotide may be effected during one step of the 
method of the invention, for example when performing the fill-in reaction. 
Alternatively, a pre-modified oligonucleotide or polynucleotide may be included in 
the steps of the method of the invention. Modifications may be manifold and 
include the modifications recited herein below as being preferred. 

Advantageously, the modification is a ribonucleotide, a spacer or a nucleotide 
comprising a detectable label. 

. Detectable labels include bioluminescent, . phosphorescent, biotinilated, 
fluorescent and radioactive labels such as labels with 32 P or 3 H. 

In a particularly advantageous embodiment of the method, said oligo- or 
polynucleotides representing the invariable sequence are modified. 

It is also preferred to automate the method of the invention. The ligation reaction 
may be assembled by . liquid handling automated system. It is additionally 
preferred that the final product is purified by HPLC. 

In an additional preferred embodiment of the method of the present 
invention, said method further comprises employing members of said collection 
of nucleic acid molecules in ligase-independent cloning (LIC). Composite primers 
for LIC have gene-specific 3'-parts and special S'-parts (LIC system, Novagen; 
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In-Fusion PCR cloning, BD Biosciences Clontech; Gateway PCR cloning system, 
Inyitrogen). 



The figures show: 

Figure 1 :, Applications of pligonudeotides with block structure. Gene-specific 
parts are white, common parts gre black. A- Template-dependent ligation of 
padlock probe. B. Primers for multiplex PGR amplification and ligation- 
independent cloning. G. Invader assay. 

Figure 2: Padlock synthesis. A. Step by step PAGE analysis of padlock 
synthesis. Bands were visualized by UV shadowing as described in Methods. 
Lane 1 - unjigated primers; (ape 2 - result of ligation; lane 3 - PAGE-purified 
padlock probe. Aliquots of the same reactions were taken for this gel. B. Scheme 
of ligation. C. PCR-based approach for padlock synthesis (Antson et al. 2000; 
Myer and Day 2001). Amplification is performed with two gene-specific primers 
having long 3' overhands (#PCR_ L and #PCR_R) on template #PCR_C. Single 
stranded padlock is purified after annealing to Streptavidine paramagnetic 
particles. 

Figure 3: Ligation of different primers with the same 4nt overhangs. Ligation of 
[y-32P]ATP labeled #top and #bot primers with (#1; #a1) and (#2; #a2); see 
scheme under Table 1. Ligation was performed as described in Methods. Lanes 
1-8: #bot primer; lanes 9-16: #top primer. Lanes (1-7) and (9-15) correspond to 
the sequential two times dilutions of T4 DNA ligase (400u for lanes 1 and 9), 
Lanes 8 and 16 - control without ligase. A. Ligation with #1 and #a1. B. Ligation 
with #2 and #a2. 

Figure 4: Application of ligated primers. A- Padlock probe circularizes only in the 
presence of perfectly matched template. Circular products have decreased 
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mobility in PAGE comparing with linear ones;. Lane 1 -control without ligase; 2 - 
ligation on #tl (perfectly matched) template; 3 - ligation on #T2 (mismatched) 
template- B. PGR a;mpjificat.ibn of 0RF of phi29 polymerase (1 ,7fcfc>) with 
composite and gehf-spe0ific primers. Lines 1-6 - composite primers (#1&#top 
and #2&#b,gt); lines 6-10 ^gene-specific primers (#top and #bot). Lanes 1 and 6 
- after 12 cycles; lanes 2 and 7 - after 1 6 cycles; lanes 3 and 8 - after 20 cycles; 
lanes 4 and 9 - after 24 cycles; lanes 5 and 10 - after 28 cycles. M - marker. 



MATERIALS AND METHODS 

Oligonucleotides were synthesized by TIB Molbipl (Berlin, Germany). Primer 
sequences are given in Table 1. T4 DNA ligase arid T4 PNK were from New 
England BioLabs (Beverly, USA). Tih ligase was from ABgene (UK). 

PolyaCiylamide gels with radiolabeled oligonucleotides were, exposed oil a Fuji 
Imaging Plate without any fixation. For long exposition (more than couple of 
hours) cassette with gel was freezed at -2Q P C. Freezing prevents diffusion of 
even 4nt long oligonucleotides. 

Padlock probes. 

Phosphorylation of #0 and #L oligonucleotides Was performed at 37°G for 1 hour. 
Inrilol of primer was incubated in 1%d of T4 PNK buffer (TrisHCI, pM 7.6 70fnM; 
MgGlz lbmM; DTT 5mM) with 1mM ATP and 2.$U of T4 PNK followed by enzyme 
heat ihactivatibn at. 65°G for 20 minutes. Phosphorylated primers were used in 
ligation reactions without purification. 
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The scheme of the ijgation-based synthesis of padlock probe is shown on Figure 
2B. 2O0pmolrscale ligation reaction was performed for 1 hour at 20°C in 2Q\xi of 
mixture: ixT4 ligase buffer (TrisHGl, pH 7.5 50mM; MgCI 2 10m.M; DTT lOmjyi; 
ATP ImM; BSA 25|xg/m!); PEG 6000 15%; T4 DNA ligase lOOu. to ensure, that 
all #0 oligonucleotides are consumed jn annealing and ligation reactions, 
adaptors and locus-specific primers were taken in a slight excess relative to the 
common primer: #Q - 200pmpl; #aR=#a_L 220pmpl; #R=#L - 240pmpl 
(1:1,1:1,2). !n some experiments adaptors were annealed to the common primer 
before addition of enzyme and Jpcus-specific primers (heating the mixture to 
90°G and cooling gradually to normal temperature), but this measure is nbt 
essential for the method of the invention. 

Padlock probes may be phpsphprylated directly in the ligation mixture after heat 
inectivatjpn of T4 DNA ligase (e. g. 65°C for 15 minutes). 

Padlocks were purified through denaturing PAGE electrophoresis. The 
corresponding band was visualized by UV shadowing on the PC Alufojien 
Kiselgel 60F254 (Merck, Germany) chromatographic plate (or on printer paper 
with a somewhat lower, sensitivity) and was cut out. DNA was ethanol 
precipitated after elution from the gel in 150^1 pf (TrisHGl, pH 7.5 lOmM; EDTA 
1mM; NaCI 200mM) for 1 hour at 60°C, 

Circularizgtipn pf 40fmpl of a [y-32P]ATP labeled padlock probe on 2fmol of 
matched or mismatched synthetic template was performed in 10pJ pf 1xTth 
ligation buffer (TrisHGl, pH 8.3 20mM; MgC! 2 10mM; KCI 50mM; EDTA 1mM; 
NAD + ImM; DTT 10mM; Triton X-100 0,1%) by 1u of Tth ligase for 20 cycles of 
(94°G for 20 sec and 60°G for 3 min). 



Composite primers for PGR. 
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Ligation was performed separately for (#top, #1, #a1) and (#bot, #2, #a2) primer 
sets: 1 hour at 20°C and 20 min at 70°C in 10ul of mixture: 1xT4 ligase buffer; 
PEG .6000 15%; T4 DNA ligase 400u; phosphorylated #tpp pr #bot primers - 
20pmol; #a1 or #a2 - 25pmo|; #1 or #2 - 30pmol. Composite primers were used 
in PGR amplification without any purification. 

Amplification was performed in SOyj with Advantage cDNA polymerase Mix 
(Clontech, USA)- In! Of phage phi29 suspension (5x1 0 10 1/ml; DSMZ, Germany) 
was used as a template. Parameters of PGR with #top arid #bot primers were: 
lOp.mol of both primers; 96 P C 2min (95°G 20sec, 62°C 20sec, 68°C 
1min20sec)x28 Cycles. PGR with composite primers: 1pmol of both composite 
primers, lOpmol of #1 and #2 primers; 96 P C 2min, (95°C 20sec, 62°C 20sec, 
68°C 1min20sec)x5 cycles, (95°C 20sec, 58°C 20sec, 68°C 1min20sep)x23 
cycles. Two different annealing temperatures were used for amplification with 
composite primers because melting temperature of external primer #1 (59;f3 p C) 
was less, than that of interna) primers #top and #bot (65°C). 

In. this specification, a number of documents is cited. The disclosure content of 
these documents including manufacturers' manuals, is herewith incorporated by 
reference in its entirety. 

The examples illustrate the invention. 
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Example 1: Padlock synthesis. 

Padlock probes (padlocks) are typically 90-1 20nt oligonucleotides, which consist 
of two locus-specific regions on both 3' and 5' ends connected by universal linker 
part (Figure 1 A). The ability of padlocks for template-dependent circularization is 
used for in situ localization and SNP detection (Antson et al. 2000; Lizardi et al. 
1998; Myer and Day 2001; Pickering et al. 2002). High quality of locus-specific 
ends is important in ligation reaction, because they should create a nick with 
perfect base pairing. 

The scheme of the ligation-based synthesis of padlocks is shown on Figure 2B. 
Adaptor primers #aL and #aR and central part primer #C (all shown black) are 
common for the whole set of padlocks. Primers #R and #L (shown white) are 
locus-specific. 5nt 3' and 5' overhangs of adaptor primers serve for ligation of 
locus-specific primers. Small excess of adapters (1,1x) and locus-specific 
primers (1,2x) guarantee that all #C oligonucleotides will be consumed in ligation. 
The scheme is practically the same for synthesis of oligos with common 3' or 5' 
regions (just one adaptor and one locus-specific primer instead of two). 

Melting temperatures of overlaps of adaptor primers with common primer are 
about 50 °C in the ligation mixture. In principle, it is possible to use shorter #C- 
cornplementary segments of adaptor primers thus decreasing the price of the 
procedure. Overhangs for locus-specific primers were selected to be 5nt long 
(see scheme below Table 1). Such a length was satisfactory for the purpose of 
the present invention. However, longer overhangs showed better ligation 
efficiency (data not presented). Moreover, for longer overhangs it is possible to 
increase the ligation efficiency to about twice by carrying out the reaction at 37°C 
- i.e. the optimal temperature for T4 DNA ligase. 

The method of the invention requires attention to the selection of primer ends 
(termini) to avoid misligation. In accordance with the present invention it was 
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found that due to the high fidelity of T4 DNA jigase this restriction is not stringent. 
To estimate the efficiency of mismatch Ligation, we have compared ligation of 3' 
overhangs (3'-CTQGG...5') with perfectly matched primers/ojigpnucleptid ©s (5'- 
...GA(3C(>3') and with two mismatched primers/oligonucleotides: (5\.,<3AGga- 
3') ancl (5V.. ; GAG0g-3 ? ). In experiments with ten times excess of ligase we have 
not detected any ligation products for overhangs containing and two 
mismatched bases. No problems with misljgation were observed for overhangs 
shown jn Table 1. It is possible to improve the ligation accuracy by increasing the 
concentration of monovalent ions in reaction buffer (Wu and Wallace 1989), and 
to exclude any participation of adaptor oligos in ligation by blocking their 3' ends. 

According to the present New England BioLabs Catalogue, 1 unit of 14 DNA 
ligase is capable to join about 1,2 pmol of nicks of VHind III digested DNA in 1 
hour. In a test ligation Of #G with #R and #aR we have obtained a similar result: 
1u - 0,4pmol. Because PpG 6000 increases the ligation efficiency (Pheiffer and 
Zimmerman 1983)^ we have checked the influence of 5% - 20% PEG 6O00 on 
the reaction. 15% PEG turned out to be the most effective, increasing the ligation 
efficiency about 15 times: 1u - 6pmol. For padlock synthesis, 0.5 unit of T4 DNA 
ligase should be added per 1 pmol of the #G primer. This ratio was determined in 
a series of small-scale ligations with decreasing amounts of the enzyme (data not 
shown) and agrees well with titration on individual nicks. 

Padlock probes were purified in denaturing PAGE (Figure 2A). The yield is 
quantitative: 70% according to spectrophotometer measurements and close to 
100% according to PAGE (Figure 2A). It seems that some UV adsorbing 
impurities are removed on this step. Purified padlock probes run as single bands 
in denaturing PAGE (Figure 2A) and are suitable for SNP-discrimjnating ligation 
(Figure 4A). PAGE purification is the most time-consuming and laborious step of 
the procedure. However, in conventional phosphoramidite synthesis, this step 
also cannot be avoided (see below). Besides, in ligation-based procedures 
distinct differences between the length of padlock and initial primers (Figure 2A) 
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allows to use HPLC purification instead of PAGE. Adaptor primers do not 
participate in ligation and may be repurified together with padlocks in large-scale 
projects. 

Ligatjprvbased procedure is superior if compared with the conventional 
phosphpramidite synthesis. 

Length restriction. Oligpnucleotide-producing companies cannot guarantee a 
quantitative yield for pligos longer than 10Qnt and dp not take orders for primers 
longer than 130nt. In contrast, the Ijgation-based method of the present invention 
is only restricted by the synthesis of individual components: for 3-component 
padlock probe the procedure is practical up to 150-180nt. 

Price and quality. According to information from all three companies listed in 
Table 2, a two-step purification (HPLC and PAGE) is required for 80-1 OOnt long 
oligonucleotides. HPLC alone cannot separate full-length oligonucleotides from 
nearby contaminants. Smallest-scale synthesis of one 100nt oligonucleotide 
costs about 170 Euro and gives about 3 nmol of product after HPLG purification 
(Table 2A). Judging from Operon the yield after PAGE purification will be less 
than 1 nmpl. 

5 nrnol - scale synthesis of single padlock oligonucleotide by ligation method 
costs 120 Euro (Table 2B). In our hands the yield after PAGE purification is more 
then 3 nmol. Comparing "170 Euro per 1 nmol" with "120 Euro per 3 nmof , the 
ligation-based synthesis of the present invention is four times cheaper. What is 
more important, the quality of ligase-based synthesis is higher. It provides bands 
practically without "n-1" contaminants. In principle, HPLC purification may be 
used instead of PAGE. 

Synthesis of large sets. For producing a set of oligonucleotides with block 
structure, by the phoshoramidite technology, common regions have to be 
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synthesized again and again. The price per pligo does not change if compared 
with the single synthesis. For example, synthesis of 40 lOOnt padlocks for 20 
SNIP loci (two padlocks per loci) by the conventional procedure costs 6720 Euro 
(Table 2A). In the Jigatio.n-based method the price per oligonucleotide drops 
dramatically, because common parts need to be synthesized only once. The 
same s;et costs 1846 Euro. The price difference is 10 fold (6720 Euro, 1 nmol of 
each padlock; T$46 Euro 3 nmol). 

If common parts contain biotin or fluorescein, conventional synthesis will be 2000 
Euro more expensive (40 padlocks x 50 Euro) and the yield drops about two fold. 
Ligatiombased synthesis will cost 100 Euro more (2 padlocks x 50 Euro) and the 
yield remains the same. 

Two PGR based methods of padlock synthesis were suggested recently (Antson 
et al. 2000; Myer and Day 2001; and Figure 2G), but they have some 
disadvantages if compared with proposed ligation-based technology. Locus- 
specific primers have long overhangs for PGR initiation (~12ht (Antson et al. 
2000) and -*18nt (Myer and Day 2001)). It is impossible to insert modified bases 
in some definite position during PCR. Using of proofreading polymerase (Antson 
et al. 2000) results in heterogeneous 3' ends of padlocks. Finally, single-strand 
purification by streptavidine PMP's is expensive (about 1ml of Dynabeads is 
required for 500pmol of padlock). 

To summarise, the ligation-based technology in accordance with the present 
invention permits to synthesize parts of composite oligonucleotides in separate 
reactions of the appropriate scale. Waste is minimal. Adaptor primers may be 
reused. Synthesis may be automated, because it is compatible with HPLG-based 
purification. For large sets of composite oligonucleotides the price per one primer 
tends almost equals the price of locus-specific parts and becomes comparable 
with that of 40-50mers, which are widely used now in molecular biology. 
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Example 2: Composite primers fpr PCR amplification. 

Composite primers with gene-specific 3* regions (Figure IB) are used for 
multiplex PGR amplification (Favis et al. 2000; Lindblad-Toh et al. 2000; 
Eurogenetics - genome primer sets), Jigase-independent cloning (LIC) and 
Invader assay (Meih et al. 2000). They have - block structure": 3' parts (17.-25nt) 
identify gene target and 5' regions define some particular application: second- 
stage PGR (16-20nt), concrete LIC scheme (LIC system, Novagen - 15nt; In- 
Fusipn PGR cloning, BP Biosciences Glontech - 16nt; Gateway PCR cloning 
system, Invitrpgen „29nt) and so on. 

Some applications require combination of blocks. For example, the same gene- 
specific part should be attached to different vector-specific parts for optimization 
of the protein expression (Dieckman et al. 2002). Frequently, the required 
amount of primers is very small. Only one successful PCR reaction is necessary 
for LIC or for preparation of genome-sequencing tags (Eurogenetics: 
http://ww.eurpgentec.OTm 

Each composite primer should be synthesized individually by conventional 
phosphoroamidite technology. Moreover, decreasing of the synthesis scale does 
not decrease the price of the primers proportionally. Some LIC methods require 
insertion of modified bases in composite oligonucleotides, for example 4-5 
dUracils (Rashtchian et al. 1992) or 2-3 phosphprothipate bonds (de Costa and 
Tanuri 1998; Zhou and Hatahet 1995). in this case the price of composite oligos 
increases in 2-3 times. 

PCR amplification is more difficult with long composite primers. Sometimes, the 
only way to obtain single-band product is to use cloned or preliminarily amplified 
fragments as a template. This means that two sets of primers should be used in 
this case: gene-specific pair and composite pair. 
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It is very convenient to prepare small amounts of composite primers by attaching 
presynthesized blocks to each other. In this case it is possible to combine gene- 
specific parts with different 5' parts. 

SmajUscale procedure for preparation of composite primers from individual 
blocks with the help of T4 RNA ligase was described previously (Kaluz et al. 
1995). This enzyme needs no overhangs for joining of two pjigos. The only 
disadvantage of the method is that some part of the reaction products (depends 
on excess of external primer) have duplication of gene-specific 3' part. It is not 
critical for some applications, but can provide problems for cloning. Another 
disadvantage is that, phosphorylation and ligation should be performed 
separately. 

In contrast to the above recited prior art approaches, the (T4 DNA) ligase based 
method of the present invention requires overhangs for ligation, but guarantees 
the homogeneity of the product. Ligated primers may be. used in PGR without 
any purification because PEG and other components of ligation mixture do not 
inhibit PGR. Ligation and phosphorilation may be performed simultaneously in a 
ligation buffer (result not shown). 

Adaptor primers do not interfere with PGR. To decrease the risk of false priming 
by adaptor primers and mis-annealing of long composite primers it is most 
appropriate to use a mixture of external and composite primers (in molar ratio 
10:1) instead of composite primers alone in an amplification reaction. 

An example of amplification with composite primers produced by ligation is 
shown in Figure 4B. The ORF of phi 29 polymerase (Blanco and Salas 1996) 
was amplified from of phage culture. In both cases the. required products 
were obtained. PGR was slower with composite primers if compare with gene 
specific ones. Worse kinetics at least partly depends on external primers, 
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because the amplification is more effective with (#top and #bot) pair than with (#1 
and #2) pair (results not shown). 

There are two possible schemes of design of adaptor primers for ligation. One 
approach is to order individual adaptor for each particular combination of 5' and 
3' parts. Adaptors should be 10-14nt long (two 5-7rit overlaps). They are cheaper 
than long composite primers. The price of external primers should be left put of 
account, because they may be used with a number of gene-specific primers. A 
20pmol aNqupt of HPLC purified external primer costs less then 10 Euro-cent 

Another apprpach is to use the standard adaptors and to prepare gene-specific 
primers with predefined overhangs (as in the method for padlock preparation, 
see above). Long overhangs may conflict with some cloning schemes, but too 
short overhangs may be a bar for the effective ligation. G/O-reach 5nt overhangs 
(5'-,. f GAGC<>3') and (5VM3GGG...-3 1 ) are sufficient for effective ligation (see 
padlock preparation above). To obtain an idea about the suitability of shorter 
sequences we have tried to use the (5'-TATG...-3') sequence for the preparation 
of composite primers. This sequence was selected because ATG may be 
combined with the first Met codpn pf the open reading frame. It turns out, that 4nt 
A/T-rich overhang requires much more ligase. Moreover, different amounts of 
enzyme were necessary for different combinations of primers (Figure 3). 
Nevertheless, 1pJ (400u) of T4 ligase was enough to prepare 20pmol of 
composite primers. It costs about 1 Euro, much less, if compared with the price 
of long composite primers. 

Conclusion. 

Ligation-based technology is based . on construction of composite 
oligonucleotides from individual blocks. Here we have demonstrated, how this 
method may be applied for two different tasks: 

(i) preparative-scale production of padlock probes; 
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(ii) small-scale synthesis of PCR primers. 

Preparative-scale procedure gives products of high quality. In contrast to the 
conventional phosphoroamidite synthesis, HPLC may be used instead of PAGE 
purification, 

Small-scale procedure is fast one-tube reaction. It does not require any 
purification and may be automated. 

In both cases the suggested method is considerably cheaper if compared with 
the conventional phosphoramidite synthesis. The price difference increases at 
least two fold, when oligonucleotides contain modified bases. 
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TABLES 

Table 1. Oligonucleotide sequences. 

SNP positions in template primers (T1 and T2) are in bold. Alignments for 
ligation-based synthesis of padlock and PCR primers are shown under the table. 
Overhangs for ligation are in bold. 



#c 


GGAGGTTGCGAGGCGTATTCATTGCTCAGAATTCACGACTCACG 


#aR 


CCTCGCAACCTCCGGCTC 


#al_ 


CCCGTCGTGAGTCGTGAATT 


#R 


TTGTAAAACGTCGGGAGAAACAGAGAGCC 


#L 


ACG GG ACATTTAAG ACCAAACTG 


#11 


. CTCTCTGTTTCTCCCGACGTTTTACAACAGTTTGGTCTTAAATGTTCGCCGC 


#12 


TCTCTGTTTCTCCCGACGTTrTACAATAGTTTGGTCTTAAATGTTCGCCG 




#top 


TATGAAGCATATGCCGAGAAAGATG 


#bot 


. tatgtttgattgtgaatgtgtcatcaac 


#1 


I I I I Ci tttaactttaag AAGGAGATATAC A 


#2 


GATCCTCAGTGGTGGTGGTGGTGGTGCA 


#a1 


CATATGTATATCTCCTTCTTA 


#a2 


CATATG CACCACCA 



i 
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Table 2. Synthesis of lOOnt primer. 
(A) Conventional method. 



Company 


Synthesis 
scale 


Price 
per base 


Purification 


yjeld and price 


Qperon 


lumpl 


2,05 
Iurp 


HPLG&PAGE 
(11$ Euro) 


O.Snmol 
HPLG&PAGE 
323 iurp 


.MWG 


>o.2 mnoi 


1.78 
Iurp 


HPSF included 


3 nrfiol 
H'PSF * 
178 Iurp 


TIB 


>0,2 ^utiol 


168 
Euro 


HPLG included 


3 nmol 
HPLG * 
168 iuro 



laboratory.. 

Operpn: QIAGEN Operoh GmbH, Cologne, Germany; 
MWG: MWG Biotech, Ebersberg, Germany 
TIB: TIB MOLBIOL, Berlin, Germany . 



(B) Ligation-based method 

(prices are given according to TIB). 



Component 


Scale and purification 


Guaranteed yield and 
price 


#l,#r ; 


up to 30 bases long, HPLG purified 


5 nrtlol; 39.08 Euro 


#aL, #aR 


up to 20 bases long, standard 
purification 


5 nmpi; 19.98 Euro 


#G 


up to 60 ba$es long, standard 
purification 


5 nmol; 55.00 Euro 


T4 DNA 
ligase 


1250u 


3.25 Euro . 


T4 PNK 


25u 


2.50 Euro 




Total: 




5nmol 
120 Euro 
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Table 3. Ugation-based synthesis of 40 1 0Ont padlocks for 20 SNP loci 

(two padlocks per one locus). 



Component 


Scale and purification 


r nee per zu loci 

(Euro) 


#L1,#L2,#R 


5 nmpl of eich, HPLC purified 


1172 


#aL and #aR 


500 fimsr of ea ch * , Standard 
purification 


114 


#01 ,#02 


250 nmpl of each *, standard 
purification 


330 


T4 DNA jigase 


50000U 


130 


T4PNK 


1000u ~ 


100 


* Two times more, than is required for 40 padlocks. 


Tbtal: 




1846 Euro 


Per one 
padlock: 


5nmpl 


46.2 Euro 
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CLAIMS 

A method of producing single-stranded nucleic acid molecules from oligo- 
or polynucleotides wherein each of said oligo- or polynucleotides has a 
predefined 5' or 3 1 terminus, comprising the steps of 

(a) annealing an adaptor oligonucleotide simultaneously or step by 
step to 

(aa) a first oligo- or polynucleotide; and 

(ab) a second oligo- or polynucleotide 

wherein the S'-terminus of said adaptor oligonucleotide is 
complementary in sequence to the 5' terminus of said first oligo- or 
polynucleotide and the S'-terminus of said adaptor molecule is 
complementary in sequence to the 3' terminus of said second oligom- 
er polynucleotide; and optionally 
(a') simultaneously with or subsequently to step (a) annealing at least 
one further adaptor oligonucleotide to free termini of said first or 
second oligonucleotides and to free termini of further oligo- or 
polynucleotides; 

(b) optionally filling in gaps between the neighbouring ends of said oligo- - 
or polynucleotides; 

. (c) ligating said oligo- or polynucleotides; and 
(d) removing said at least one adaptor oligonucleotide. 

2. The method, of claim 1 wherein the complementarity in sequence is at 
least four nucleotides. 

3. The method of claim 1 or 2 wherein annealing and ligation are 
. simultaneously performed. 

4. The method of any one of claims 1 to 3 wherein the adaptor 
oligonucleotide(s) in step (a) and/or (a 1 ) is/are provided in molar excess 
over the first or second or further oligo- or polynucleotides. 
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5. The method of any one of claims 1 to 4 wherein said single-stranded 
nucleic acid molecules represent a collection of nucleic acid molecules and 
wherein either said first or said second oligo- or polynucleotide is invariable 
in .sequence between all members of said collection of nucleic acid 
molecules, . 

6. The method of claim 5 wherein said first . or said second oligo- or 
polynucleotide which is not invariable is variable in sequence between 
different members of said collection of nucleic acid molecules. 

7. The method. of claim 5 or 6 wherein the further oligo- or polynucleotides 
are variable in sequence between different members of said collection of 
nucleic add molecules. 

8. The method of any one of claims 5 to 7 wherein the oligo- or 
polynucleotides representing said variable sequences are provided in 
molar excess over the nucleic acid molecule representing said invariable 
sequences. 

9. The method of any one of claims 5 to 8 wherein the 5' or 3' termini of said 
oligo- or polynucleotides representing said variable sequences which 
anneal to said 5' or 3' termini of said adaptor oligonucleotide are invariable 
between different members of said oligo- or polynucleotides representing 
said variable sequences. 

10. The method of any one of claims 1 to 9 where ligation is effected with 
T4/DNA (igase. 

11. The method of any one of claims 1 to 10 wherein the ligation reaction is 
. carried out in the presence of at least 5% polyethylene glycol. 

12. The method of claim 8 wherein the ligation reaction is carried out in the 
presence of about 1 5% polyethylene glycol. 
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13. The method of claim 11 or 12 wherein said polyethylene glycol is 
polyethylene glycol 6000, . 

14. The method of any one of claims 1 to 1 1 wherein about 1 unit of T4/DNA 
ligase is reacted in step (c) with about 4 pmol of termini of the oligo- or 
polynucleotides annealed to said adaptor molecule(s). 

15. The method of any one of claims 1 to 12 further comprising the step of 
purifying said single-stranded nucleic acid molecules. . 

16. The method of claim 15 wherein purification includes PAGE 
electrophoresis, HPLC or chromatography. 

17; The method of any ope of claims 1 to 16 further comprising modifying at 
least one of said oligo- or polynucleotides. 

18. The method of any one of claims 1 to . 16 wherein at least one of said oljgo- 
or polynucleotides is modified. 

19. The method of claim 17 or 18 wherein the modification is a ribonucleotide, 
a spacer or a nucleotide comprising a detectable label. 

20. The method of any one of claims 17 to 19 wherein said oligo- or 
polynucleotides representing the invariable sequence are modified. 

21. The method of any. one of claims 5 to 20 further comprising employing 
members of said collection of nucleic acid molecules in the determination 
of SNPs in vitro. 

22. The method of any one of claims 5 to 20 further comprising employing 
members of said collection of nucleic acid molecules in ligase-independent 
cloning or two-step PGR. 
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ABSTRACT 



The present invention relates to a method of producing single-stranded nucleic 
acid molecules from pligo- pr polynucleotides wherein each of said gjigo- or 
polynucleotides has a predefined 5' or 3' terminus, comprising the steps of (a) 
annealing an adaptor pligphucleptide simultaneously or step by step to (aa) a 
first pligo- or polynucleotide; and (ab) a second pligo- or polynucleotide 
wherein the S'-terminus pf said adaptor oligonucleotide is complementary in 
sequence to the 5' terminus pf said first oligo- or polynucleotide and the 3'- 
terrninus of said adaptor molecule is complementary in sequence to the 3' 
terminus pf said second pligp- or polynucleotide; and optionally (a') 
simultaneously with or subsequently to step (a) annealing at least one further 
adaptor oligonucleotide to free termini pf said first or second oligonucleotides 
and to free termini of further pligo- or polynucleotides; (b) optionally filling in 
gaps between the neighbouring ends of said pligo- or polynucleotides; (c) 
ligating said pligo- or polynucleotides; and (d) removing said at least one 
adaptor oligonucleotide. In a preferred embodiment of the methpd of the 
invention, said single-stranded nucleic acid molecules represent a collection of 
nucleic acid molecules wherein either said first or said second oligo- or 
polynucleotide is invariable in sequence between ail members of said collection 
of nucleic acid molecules. 
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